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1 SEQUENCE LISTING 
2 

3 (1) General Information: 

5 (i) APPLICANTS: Nicholas J. Deacon £ /Vf "7* fl^ f% 

6 Jennifer C. Learmont ' 0^* O ^ 

7 Dale A. McPhee ^ ' **** 

8 Suzanne Crowe 

9 David Cooper 
10 

11 (ii) TITLE OF INVENTION: NON- PATHOGENIC STRAINS OF HIV-1 
12 

13 (iii) NUMBER OF SEQUENCES: 841 
14 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: SCULLY, SCOTT, MURPHY S. PRESSER 

17 (B) STREET: 400 GARDEN CITY PLAZA 

18 (C) CITY: GARDEN CITY 

19 (D) STATE: NEW YORK 

20 (E) COUNTRY: U.S.A. 

21 (F) ZIP: 11530-0299 
22 

23 (V) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC -DOS/MS- DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
28 

29 (vi) CURRENT APPLICATION DATA: 

30 (A) APPLICATION NUMBER: 09/146,783 

31 (B) FILING DATE: 
32 

33 (Vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: 08/477 , 464 

35 (B) FILING DATE: 07-JUN-1995 

36 (A) APPLICATION NUMBER: PM3864 (AU) 

37 (B) FILING DATE: 14-FEB-1994 

38 (A) APPLICATION NUMBER: PM4002 (AU) 

39 (B) FILING DATE: 21-FEB-1994 

40 (A) APPLICATION NUMBER: PN0284 (AU) 

41 (B) FILING DATE: 23-DEC-1994 
42 

4 3 (viii) ATTORNEY/ AGENT INFORMATION: 

44 (A) NAME: FRANK S. DIGIGLIO 

45 (C) REFERENCE/DOCKET NUMBER: 9606Z-I 
46 
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47 (ix) TELECOMMUNICATION INFORMATION: 

48 (A) TELEPHONE: (516) 742-4343 

49 (B) TELEFAX: (516) 742-4366 
50 

51 

52 (2) INFORMATION FOR SEQ ID NO:l: 
53 

54 (i) SEQUENCE CHARACTERISTICS: 3 

55 (A) LENGTH: 9709 base pairs 

56 (B) TYPE: nucleic acid 

57 (C) STRANDEDNESS : single 

58 (D) TOPOLOGY: linear 
59 

60 (ii) MOLECULE TYPE: DNA 

61 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

63 

64 

65 TGGAAGGGCT AATTTGGTCC CAAAAAAGAC AAGAGATCCT TGATCTGTGG ATCTACCACA 60 
66 

67 CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGATC AGATATCCAC 120 
68 

69 TGACCTTTGG ATGGTGCTTC AAGTTAGTAC CAGTTGAACC AGAGCAAGTA GAAGAGGCCA 180 
70 

71 AATAAGGAGA GAAGAACAGC TTGTTACACC CTATGAGCCA GCATGGGATG GAGGACCCGG 240 
72 

7 3 AGGGAGAAGT ATTAGTGTGG AAGTTTGACA GCCTCCTAGC ATTTCGTCAC ATGGCCCGAG 300 
74 

75 AGCTGCATCC GGAGTACTAC AAAGACTGCT GACATCGAGC TTTCTACAAG GGACTTTCCG 360 
76 

77 CTGGGGACTT TCCAGGGAGG TGTGGCCTGG GCGGGACTGG GGAGTGGCGA GCCCTCAGAT 4 20 

78 

7 9 GCTACATATA AGCAGCTGCT TTTTGCCTGT ACTGGGTCTC TCTGGTTAGA CCAGATCTGA 480 
80 

81 GCCTGGGAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA AAGCTTGCCT 540 
82 

83 TGAGTGCTCA AAGTAGTGTG TGCCCGTCTG TTGTGTGACT CTGGTAACTA GAGATCCCTC 600 
84 

85 AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGCG CCCGAACAGG GACTTGAAAG 660 
86 

87 CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG GCTTGCTGAA GCGCGCACGG 720 
88 

89 CAAGAGGCGA GGGGCGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC GGAGGCTAGA 780 
90 

91 AGGAGAGAGA TGGGTGCGAG AGCGTCGGTA TTAAGCGGGG GAGAATTAGA TAAATGGGAA 840 
92 

93 AAAATTCGGT TAAGGCCAGG GGGAAAGAAA CAATATAAAC TAAAACATAT AGTATGGGCA 900 
94 

95 AGCAGGGAGC TAGAACGATT CGCAGTTAAT CCTGGCCTTT TAGAGACATC AGAAGGCTGT 960 
96 

97 AGACAAATAC TGGGACAGCT ACAACCATCC CTTCAGACAG GATCAGAAGA ACTTAGATCA 1020 
98 

99 TTATATAATA CAATAGCAGT CCTCTATTGT GTGCATCAAA GGATAGATGT AAAAGACACC 1080 
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100 

101 AAGGAAGCCT TAGATAAGAT AGAGGAAGAG CAAAACAAAA GTAAGAAAAA GGCACAGCAA 1140 
102 

103 GCAGCAGCTG ACACAGGAAA CAACAGCCAG GTCAGCCAAA ATTACCCTAT AGTGCAGAAC 1200 
104 

105 CTCCAGGGGC AAATGGTACA TCAGGCCATA TCACCTAGAA CTTTAAATGC ATGGGTAAAA 1260 
106 

107 GTAGTAGAAG AGAAGGCTTT CAGCCCAGAA GTAATACCCA TGTTTTCAGC ATTATCAGAA 1320 
108 

109 GGAGCCACCC CACAAGATTT AAATACCATG CTAAACACAG TGGGGGGACA TCAAGCAGCC 1380 
110 

111 ATGCAAATGT TAAAAGAGAC CATCAATGAG GAAGCTGCAG AATGGGATAG ATTGCATCCA 1440 
112 

113 GTGCATGCAG GGCCTATTGC ACCAGGCCAG ATGAGAGAAC CAAGGGGAAG TGACATAGCA 1500 
114 

115 GGAACTACTA GTACCCTTCA GGAACAAATA GGATGGATGA CACATAATCC ACCTATCCCA 1560 
116 

117 GTAGGAGAAA TCTATAAAAG ATGGATAATC CTGGGATTAA ATAAAATAGT AAGAATGTAT 1620 
118 

119 AGCCCTACCA GCATTCTGGA CATAAGACAA GGACCAAAGG AACCCTTTAG AGACTATGTA 1680 
120 

121 GACCGATTCT ATAAAACTCT AAGAGCCGAG CAAGCTTCAC AAGAGGTAAA AAATTGGATG 1740 

122 ACAGAAACCT TGTTGGTCCA AAATGCGAAC CCAGATTGTA AGACTATTTT AAAAGCATTG 1800 
123 

124 GGACCAGGAG CGACACTAGA AGAAATGATG ACAGCATGTC AGGGAGTGGG GGGACCCGGC 1860 
125 

126 CATAAAGCAA GAGTTTTGGC TGAAGCAATG AGCCAAGTAA CAAATCCAGC TACCATAATG 1920 
127 

128 ATACAGAAAG GCAATTTTAG GAACCAAAGA AAGACTGTTA AGTGTTTCAA TTGTGGCAAA 1980 
129 

130 GAAGGGCACA TAGCCAAAAA TTGCAGGGCC CCTAGGAAAA AGGGCTGTTG GAAATGTGGA 2040 
131 

132 AAGGAAGGAC ACCAAATGAA AGATTGTACT GAGAGACAGG CTAATTTTTT AGGGAAGATC 2100 
133 

134 TGGCCTTCCC ACAAGGGAAG GCCAGGGAAT TTTCTTCAGA GCAGACCAGA GCCAACAGCC 2160 
135 

136 CCACCAGAAG AGAGCTTCAG GTTTGGGGAA GAGACAACAA CTCCCTCTCA GAAGCAGGAG 2 220 
137 

138 CCGATAGACA AGGAACTGTA TCCTTTAGCT TCCCTCAGAT CACTCTTTGG CAGCGACCCC 2 280 
139 

140 TCGTCACAAT AAAGATAGGG GGGCAATTAA AGGAAGCTCT ATTAGATACA GGAGCAGATG 2 340 
141 

142 ATACAGTATT AGAAGAAATG AATTTGCCAG GAAGATGGAA ACCAAAAATG ATAGGGGGAA 2400 
143 

144 TTGGAGGTTT TATCAAAGTA GGACAGTATG ATCAGATACT CATAGAAATC TGCGGACATA 2460 
145 

146 AAGCTATAGG TACAGTATTA GTAGGACCTA CACCTGTCAA CATAATTGGA AGAAATCTGT 2520 
147 

148 TGACTCAGAT TGGCTGCACT TTAAATTTTC CCATTAGTCC TATTGAGACT GTACCAGTAA 2580 
149 

150 AATTAAAGCC AGGAATGGAT GGCCCAAAAG TTAAACAATG GCCATTGACA GAAGAAAAAA 2640 
151 

152 TAAAAGCATT AGTAGAAATT TGTACAGAAA TGGAAAAGGA AGGAAAAATT TCAAAAATTG 2700 
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153 

154 GGCCTGAAAA TCCATACAAT ACTCCAGTAT TTGCCATAAA GAAAAAAGAC AGTACTAAAT 2760 
155 

156 GGAGAAAATT AGTAGATTTC AGAGAACTTA ATAAGAGAAC TCAAGATTTC TGGGAAGTTC 2820 
157 

158 AATTAGGAAT ACCACATCCT GCAGGGTTAA AACAGAAAAA ATCAGTAACA GTACTGGATG 2880 
159 

160 TGGGCGATGC ATATTTTTCA GTTCCCTTAG ATAAAGACTT CAGGAAGTAT ACTGCATTTA 2940 
161 

162 CCATACCTAG TATAAACAAT GAGACACCAG GG ATT AG ATA TCAGTACAAT GTGCTTCCAC 3000 
163 

164 AGGGATGGAA AGGATCACCA GCAATATTCC AGTGTAGCAT GACAAAAATC TTAGAGCCTT 3060 
165 

166 TTAGAAAACA AAATCCAGAC ATAGTCATCT ATCAATACAT GGATGATTTG TATGTAGGAT 3120 
167 

168 CTGACTTAGA AATAGGGCAG CATAGAACAA AAATAGAGGA ACTGAGACAA CATCTGTTGA 3180 
169 

170 GGTGGGGATT TACCACACCA GACAAAAAAC ATCAGAAAGA ACCTCCATTC CTTTGGATGG 3240 
171 

172 GTTATGAACT CCATCCTGAT AAATGGACAG TACAGCCTAT AGTGCTGCCA GAAAAGGACA 3 300 
173 

174 GCTGGACTGT CAATGACATA CAGAAATTAG TGGGAAAATT GAATTGGGCA AGTCAGATTT 3 360 
175 

176 ATGCAGGGAT TAAAGTAAGG CAATTATGTA AACTTCTTAG GGGAACCAAA GCACTAACAG 3420 
177 

178 AAGTAGTACC ACTAACAGAA GAAGCAGAGC TAGAACTGGC AGAAAACAGG GAGATTCTAA 3480 
179 

180 AAGAACCGGT ACATGGAGTG TATTATGACC CATCAAAAGA CTTAATAGCA GAAATACAGA 3540 
181 

182 AGCAGGGGCA AGGCCAATGG ACATATCAAA TTTATCAAGA GCCATTTAAA AATCTGAAAA 3600 
183 

184 CAGGAAAATA TGCAAGAATG AAGGGTGCCC ACACTAATGA TGTGAAACAA TTAACAGAGG 3660 
185 

186 CAGTACAAAA AATAGCCACA GAAAGCATAG TAATATGGGG AAAGACTCCT AAATTTAAAT 37 20 
187 

188 TACCCATACA AAAGGAAACA TGGGAAGCAT GGTGGACAGA GTATTGGCAA GCCACCTGGA 3780 
189 

190 TTCCTGAGTG GGAGTTTGTC AATACCCCTC CCTTAGTGAA GTTATGGTAC CAGTTAGAGA 3840 
191 

192 AAGAACCCAT AATAGGAGCA GAAACTTTCT ATGTAGATGG GGCAGCCAAT AGGGAAACTA 3900 
193 

194 AATTAGGAAA AG C AG GAT AT GTAACTGACA GAGGAAGACA AAAAGTTGTC CCCCTAACGG 3960 
195 

196 ACACAACAAA TCAGAAGACT GAGTTACAAG CAATTCATCT AGCTTTGCAG GATTCGGGAT 40 20 
197 

198 TAGAAGTAAA CATAGTGACA GACTCACAAT ATGCATTGGG AATCATTCAA GCACAACCAG 4080 
199 

200 ATAAGAGTGA ATCAGAGTTA GTCAGTCAAA TAATAGAGCA GTTAATAAAA AAGGAAAAAG 4140 
201 

202 TCTACCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG AAATGAACAA GTAGATGGGT 4200 
203 

204 TGGTCAGTGC TGGAATCAGG AAAGTACTAT TTTTAGATGG AATAGATAAG GCCCAAGAAG 4260 
205 
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206 AACATGAGAA ATATCACAGT AATTGGAGAG CAATGGCTAG TGATTTTAAC CTACCACCTG 4 320 
207 

208 TAGTAGCAAA AGAAATAGTA GCCAGCTGTG ATAAATGTCA GCTAAAAGGG GAAGCCATGC 4 380 
209 

210 ATGGACAAGT AGACTGTAGC CCAGGAATAT GGCAGCTAGA TTGTACACAT TTAGAAGGAA 4440 
211 

212 AAGTTATCTT GGTAGCAGTT CATGTAGCCA GTGGATATAT AGAAGCAGAA GTAATTCCAG 4500 
213 

214 CAGAGACAGG GCAAGAAACA GCATACTTCC TCTTAAAATT AGCAGGAAGA TGGCCAGTAA 4560 
215 

216 AAACAGTACA TACAGACAAT GGCAGCAATT TCACCAGTAC TACAGTTAAG GCCGCCTGTT 4620 
217 

218 GGTGGGCGGG GATCAAGCAG GAATTTGGCA TTCCCTACAA TCCCCAAAGT CAAGGAGTAA 4680 
219 

220 TAGAATCTAT GAATAAAGAA TTAAAGAAAA TTATAGGACA GGTAAGAGAT CAGGCTGAAC 4740 
221 

222 ATCTTAAGAC AGCAGTACAA ATGGCAGTAT TCATCCACAA TTTTAAAAGA AAAGGGGGGA 4800 
223 

224 TTGGGGGGTA CAGTGCAGGG GAAAGAATAG TAGACATAAT AGCAACAGAC ATACAAACTA 4860 
225 

226 AAGAATTACA AAAACAAATT ACAAAAATTC AAAATTTTCG GGTTTATTAC AGGGACAGCA 4 920 
227 

228 GAGATCCAGT TTGGAAAGGA CCAGCAAAGC TCCTCTGGAA AGGTGAAGGG GCAGTAGTAA 4980 
229 

2 30 TACAAGATAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA AGCAAAGATC ATCAGGGATT 5040 
231 

2 32 ATGGAAAACA GATGGCAGGT GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAACACA 5100 
233 

2 34 TGGAAAAGAT TAGTAAAACA CCATATGTAT ATTTCAAGGA AAGCTAAGGA CTGGTTTTAT 5160 
235 

2 36 AGACATCACT ATGAAAGTAC TAATCCAAAA ATAAGTTCAG AAGTACACAT CCCACTAGGG 5220 
237 

2 38 GATGCTAAAT TAGTAATAAC AACATATTGG GGTCTGCATA CAGGAGAAAG AGACTGGCAT 5280 
239 

240 TTGGGTCAGG GAGTCTCCAT AGAATGGAGG AAAAAGAGAT ATAGCACACA AGTAGACCCT 5 340 
241 

24 2 GACCTAGCAG ACCAACTAAT TCATCTGCAC TATTTTGATT GTTTTTCAGA ATCTGCTATA 5400 
243 

244 AGAAATACCA TATTAGGACG TATAGTTAGT CCTAGGTGTG AATATCAAGC AGGACATAAC 5460 
245 

246 AAGGTAGGAT CTCTACAGTA CTTGGCACTA GCAGCATTAA TAAAACCAAA ACAGATAAAG 5520 
247 

248 CCACCTTTGC CTAGTGTTAG GAAACTGACA GAGGACAGAT GGAACAAGCC CCAGAAGACC 5580 
249 

250 AAGGGCCACA GAGGGAGCCA TACAATGAAT GGACACTAGA GCTTTTAGAG GAACTTAAGA 5640 
251 

252 GTGAAGCTGT TAGACATTTT CCTAGGATAT GGCTCCATAA CTTAGGACAA CATATCTATG 5700 
253 

254 AAACTTACGG GGATACTTGG GCAGGAGTGG AAGCCATAAT AAGAATTCTG CAACAACTGC 5760 
255 

256 TGTTTATCCA TTTCAGAATT GGGTGTCGAC ATAGCAGAAT AGGCGTTACT CGACAGAGGA 5820 
257 

258 GAGCAAGAAA TGGAGCCAGT AGATCCTAGA CTAGAGCCCT GGAAGCATCC AGGAAGTCAG 5880 
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